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ABSTRACT 


We measure the topology (genus) of isodensity contour surfaces in volume limited 
subsets of the 1.2 Jy IRAS redshift survey, for smoothing scales A = 4 h~^ Mpc, 

7 h~^ Mpc, and 12 h~^ Mpc. At 12 h~^ Mpc, the observed genus curve has a 
symmetric form similar to that predicted for a Gaussian random field. At the shorter 
smoothing lengths, the observed genus curve shows a modest shift in the direction 
of an isolated cluster or “meatball” topology. We use mock catalogs drawn from 
cosmological N-body simulations to investigate the systematic biases that affect 
topology measurements in samples of this size and to determine the full covariance 
matrix of the expected random errors. We incorporate the error correlations into our 
evaluations of theoretical models, obtaining both frequentist assessments of absolute 
goodness-of-fit and Bayesian assessments of models’ relative likelihoods. We compare 
the observed topology of the 1.2 Jy survey to the predictions of dynamically evolved, 
unbiased, gravitational instability models that have Gaussian initial conditions. The 
model with an n = —1, power-law initial power spectrum achieves the best overall 
agreement with the data, though models with a low-density cold dark matter power 
spectrum and an n = 0 power-law spectrum are also consistent. The observed topology 
is inconsistent with an initially Gaussian model that has n = —2, and it is strongly 
inconsistent with a Voronoi foam model, which has a non-Gaussian, bubble topology. 


Subject headings: galaxies: clustering, large-scale structure of universe 
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1. Introduction 


According to the most popular theories of structure formation, the observed distribution 
of galaxies — a complex network of clusters, superclusters, tunnels, and voids — developed by 
gravitational instability from Gaussian primordial fluctuations. Two different and complementary 
approaches have been followed to test the Gaussian hypothesis. The first uses the probability 
distribution function (PDF) or its moments (skewness, kurtosis, etc.); observed results are 
compared to predictions for a gravitationally evolved Gaussian field, which are computed either 
by numerical simulations or by various approximation schemes (e.g.. Fry 1984; Bernardeau 1992; 
Juszkiewicz, Bouchet, &: Golombi 1993; Juszkiewicz et al. 1995; Bernardeau & Kofman 1995; 
Protogeros & Scherrer 1997). The second approach uses topological characteristics of the galaxy 
density field, quantified by percolation analysis (Shandarin & Zel’dovich 1983; Yess, Shandarin & 
Fisher 1997) or by the genus of isodensity contours (Gott, Melott, & Dickinson 1986, hereafter 
GMD; Gott, Weinberg, & Melott 1987, hereafter GWM; for a review see Melott 1990). The genus 
measure also yields constraints on the index of the primordial power spectrum by quantifying the 
“corrugation” of structure in the smoothed density field. In this paper we apply the genus method 
to one of the largest complete galaxy redshift surveys, the 1.2 Jy IRAS redshift survey (Fisher et 
al. 1995). We make extensive use of mock catalogs drawn from cosmological N-body simulations 
to estimate systematic and random errors and to evaluate the viability of models. Our statistical 
methodology should also be useful for topological analyses of future, larger redshift surveys. 


Our basic approach follows that of GMD, GWM, and Gott et al. (1989). From the galaxy 
distribution, we create a density field by convolving with a Gaussian window, 


W{r) 


(27r)3/2A3 


( 1 ) 


[Note that our definition of the smoothing length, A, based on the conventional form of a Gaussian 
window, differs by a factor of 2^/^ from that used in the Gott et al. papers.] We then construct 
isodensity contours at a variety of threshold levels and measure the genus of each. Applying the 
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Gauss-Bonnet theorem, GMD define the genus Gg through the integrated Gaussian curvature, 

with K = 1 /( 0102 ) where oi and 02 are the principal radii of curvature. Equation i) differs 
slightly from the standard mathematical definition of the genus, but it is useful for cosmological 
purposes because it can be applied to a contour that runs into the boundary of a finite survey 
and it defines a quantity that is, statistically, proportional to volume. For a compact surface, 

1 -|- Gs is the number of handles or holes (in the sense of donut holes), while a surface broken 
into n disjoint, simply connected pieces (e.g., n spheres) has Gg = —n. We measure Gg using the 
program CONTOUR (Weinberg 1988), based on the algorithm of GMD (see Coles, Davies, & 
Pearson 1996 for an alternative method of computing the genus). 

For a Gaussian random field, the mean genus per unit volume is 

gg = A{1 - (3) 


where u, is the threshold density of the contour in units of the standard deviation (Doroshkevich 
1970; Adler 1981; Bardeen et al. 1986; Hamilton, Gott, & Weinberg 1986). Positive and negative 
fluctuations are statistically interchangeable in a Gaussian field, so the = 0 contour has a 
spongelike topology (positive Gg) in which the high and low density regions are both multiply 
connected and mutually interlocking. At high or low u the genus becomes negative as a typical 
contour breaks into separate bags around isolated clusters or voids, but the dependence is 
symmetric about z/ = 0. The normalizing constant A depends on the second moment of the power 
spectrum. For a field with a power-law spectrum P{k) oc A:” smoothed with the Gaussian filter of 
equation (|I|), it is 

1 p + n\3/2 
“ 47r2A3 V 6 / 

A field with more small scale power (higher n) has choppier, more corrugated structure, and hence 
a higher genus per unit volume for a given smoothing length. Since the smoothing length provides 
the only characteristic length scale in a Gaussian field with a power-law spectrum, the mean genus 
per unit volume necessarily scales as for fixed n. 
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Linear evolution preserves the Gaussian character of the initial density field. Nonlinear 
evolution does not, but the effects of nonlinear evolution on the genus curve are modest if the 
smoothing length is greater than or equal to the correlation length and one characterizes contours 
by the fractional volume they enclose rather than the density level per se (GWM; Weinberg, 
Gott & Melott 1987). Volume weighting also makes the genus curve insensitive to “biased” 
galaxy formation, since even nonlinear bias tends to maintain a monotonic relation between 
galaxy density and mass density. The information “lost” by volume weighting is precisely that 
contained in the PDF, so with this approach the genus curve and PDF at a given smoothing scale 
provide independent and complementary information about the density field. For convenience, 
we characterize a contour that encloses fractional volume / (in the region above the threshold 
density) by the value of for a corresponding contour in a Gaussian field, defined through the 
implicit equation 

/ = (27r)“^/^ f e~^^l‘^dx. (5) 

J U 

With this definition, equation (|^ continues to hold for a Gaussian field, and it remains a good 
first approximation as the field evolves into the nonlinear regime (Melott, Weinberg, & Gott 1988; 
Park & Gott 1991). Using second-order perturbation theory, Matsubara (1994) and Matsubara & 
Suto (1996) show that even weakly non-linear evolution distorts the shape of the genus curve if it 
is plotted as a function of density contrast rather than fractional volume or equivalent v. 

The genus statistic has been applied previously to six different redshift surveys of optically 
selected galaxies (Gott et al. 1989; Park, Gott, &: da Costa 1992; Vogeley et al. 1994), and to the 
QDOT survey, a l-in-6 subset of IRAS galaxies with 60^ flux density brighter than 0.6 Jy (Moore 
et al. 1992). It has also been applied to redshift surveys of Abell clusters (Gott et al. 1989; 
Rhoads, Gott, & Postman 1994), and its 2-dimensional analog has been applied to redshift slices 
(Park et al. 1992; Colley 1996), to projected galaxy and cluster catalogs (Coles h Plionis 1991; 
Plionis, Valdardini, & Coles 1992; Gott et al. 1992), and to CORE maps of cosmic microwave 
background fluctuations (Smoot et al. 1994; Colley, Gott, &: Park 1996; Kogut et al. 1996). 


The 1.2 Jy IRAS survey contains 5321 galaxies and covers all of the sky except for the 
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Galactic plane (Galactic latitude \b\ < 5°) and a number of small, isolated patches at high Galactic 
latitude, covering about 4% of the sky in total. For our analysis we use a catalog provided by 
M. Strauss in which these high latitude regions have been filled with randomly placed galaxies. 
Observational details of the survey are described in Strauss et al. (1992) and Fisher et al. (1995). 
The survey has been the basis for many statistical investigations of large scale structure including 
the power spectrum (Fisher et al. 1993; Gole, Fisher, & Weinberg 1995), the two-point correlation 
function (Fisher et al. 1994ab), and moments of the counts-in-cells distribution (Bouchet et al. 
1993). It has also been used in comparisons between predicted and observed peculiar velocity 
fields (e.g., Davis, Nusser, & Willick 1996). 

For topological analysis we use volume limited subsets of the redshift survey, so that the 
physical properties of the tracer galaxies and the effects of shot noise are uniform throughout the 
survey volume. A volume limited sample consists of those galaxies within distance i?max that are 
luminous enough that they would still exceed the survey flux limit if they were at distance i?max- 
The number of galaxies in a volume limited sample first increases with i?max as the survey volume 
grows, then declines at large i?max as the fraction of galaxies luminous enough to be seen at the 
sample edge begins to decline rapidly. For the 1.2 Jy survey, the size of a volume limited sample 
peaks at ~ 1100 galaxies for iimax ~ 60 h~^ Mpc.^ We infer a galaxy’s distance from its redshift 
referred to the frame of the Local Group. 

The smoothing length must be large enough to suppress shot noise fluctuations in the density 
field, but at fixed i?max increasing the smoothing length reduces the number of independent 
resolution elements in the survey volume. Following the rule of thumb suggested by Weinberg et 
al. (1987) and used in subsequent observational analyses, we adopt a smoothing length A d(^/2, 
where d = Ug is the mean intergalaxy separation. (The \/2 factor does not appear in the earlier 
papers because of their different Gaussian filter definition.) We discuss this choice further in §3 
below. The number of resolution elements in the smoothed density field, i.e., the ratio of the 


^/i = Ffo/lOOkms ^ Mpc 
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survey volume to the smoothing volume, is 


where oog 


Nr^.g = 


iO 


r3 


(27r)^/^A^ ’ 


47r(l — sin sr is the solid angle of the 1.2 Jy survey. 


( 6 ) 


Figure 1 plots the mean separation d and the number of resolution elements A^resj computed 
from equation ^ assuming A = dj\/2, as a function of the sample depth R^^sx- The number of 
resolution elements (which is proportional to the number of galaxies in the volume limited sample) 
peaks at ~ 170 for i?max = 60 h~^ Mpc, where the mean separation is d = 9.5 h~^ Mpc. In this 
sense, the i?max = 60 h~^ Mpc sample is the optimal volume limited subset that we can construct 
for topological analysis, and we focus our greatest attention on this sample. The corresponding 
smoothing scale is A « 7 h~^ Mpc. Since the dependence of topology on smoothing scale is itself 
interesting, we also analyze samples with i?max = 30 h~^ Mpc and 100 h~^ Mpc, using smoothing 
scales A = 4 h~^ Mpc and 12 h~^ Mpc, respectively. 


In the next section we describe our procedure for creating mock catalogs designed to mimic 
the 1.2 Jy redshift survey. In §3 we use these mock catalogs to study the systematic distortions 
in the genus curve that arise from shot noise, the finite size of the survey volume, and peculiar 
velocities. In §4 we use the mock catalogs to examine the magnitude and covariance of random 
errors in the genus curves of the 1.2 Jy subsamples, and we outline a statistical methodology for 
comparing observed and predicted genus curves. In §5 we present results for the 1.2 Jy survey and 
compare them to theoretical predictions. We summarize our conclusions in §6. 


2. Construction of Mock Catalogs 

In order to assess uncertainties in our observational analysis and generate theoretical 
predictions for comparison to the data, we want realistic mock catalogs that will be subject to the 
same systematic effects as the 1.2 Jy survey. For our primary set of mock catalogs, we use the 
same N-body simulations as Cole et al. (1995). These simulations have Gaussian initial conditions 
with the F = 0.25 power spectrum of Efstathiou, Bond &: White (1992), which produces large scale 





Fig. 1.— The mean separation d (solid line, left axis scale) and the number of resolution elements 
A^res (dotted line, right axis scale) for volume limited samples of depth i?max- Distances are in 
h~^ Mpc. 
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clustering consistent with recent studies of IRAS galaxies. The linear theory power spectrum is 
normalized to an rms fluctuation cjg = 0.8 in spheres of radius 8 h~^ Mpc at z = 0, and the density 
parameter is Qq = 0.3 (with no cosmological constant). There are four independent realizations, 
each of a periodic box of size /box = 400 h~^ Mpc. The simulations use a staggered-mesh PM code 
written by C. Park (1990) to evolve a density field represented by 200^ particles, with a 400^ mesh 
for force computations. The simulations begin at a redshift of 24 and evolve to the present in 48 
equal steps, using the expansion factor a as the time variable for integration. The large timesteps 
are adequate because of the rather low (~ 1 — 2 Mpc) force resolution of the computations. 
Because we are interested in properties of the galaxy density field smoothed over several h~^ Mpc, 
this force resolution is more than sufficient for our purposes. 

To create a mock catalog from the simulation, we first select a random particle to represent 
the “observer.” The Local Group is known to be in a region where shear and dispersion in the 
peculiar velocity field are fairly low. We therefore reject observer particles if the 3—dimensional 
peculiar velocity dispersion within a surrounding sphere of radius 5 h~^ Mpc exceeds 200 kms“^, 
thus avoiding observers in rich clusters and other regions where peculiar velocity distortions would 
be radically different from those affecting samples centered on the Local Group. Given an observer 
that passes this velocity dispersion cut, we select particles in a surrounding sphere of radius 
Rmax- We compute redshift space positions r = Hq{v — vq) • r, where v is the particle velocity 
(including Hubble flow relative to the observer) and vq is the average velocity of all particles 
within 1 h~^ Mpc of the observer, thus mimicking the procedure of referring galaxy redshifts to 
the Local Group frame. We randomly sample the particle distribution within the sphere to obtain 
a mean interparticle separation equal to that of the corresponding IRAS sample: d = 9.5 h~^ Mpc 
for Rmax = 60 h~^ Mpc, d = 5.5 h~^ Mpc for Rmax = 30 h~^ Mpc, and d = 17.0 h~^ Mpc 
for Rmax = 100 h~^ Mpc. Finally, we eliminate particles in a 10-degree wedge to represent the 
Galactic plane cut in the 1.2 Jy survey. We create 512 mock catalogs for each value of i?max, 
drawing 128 from each of the four simulations. The ratio of the volume of an individual simulation 
to the volume of an individual mock catalog is 77 x (60 h~^ Mpc/Rmax)^; so in general these 128 
mock catalogs are largely but not completely independent. 
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The r = 0.25 power spectrum has the shape predicted for a cold dark matter (CDM) model 
with scale-invariant primeval fluctuations, Hq = 0-3, and a Hubble constant h ~ 0.8 (Efstathiou 
et al. 1992). We therefore refer to these simulations below as the CDM simulations. In order 
to test Gaussian models with other initial power spectra, we also construct mock catalogs from 
simulations with initial spectra P{k) oc with n = 0, —1, and —2. We again assume Dq = 0.3, 
normalize the linear power spectra to erg = 0.8, and run four realizations of each model. The 
numerical parameters of these simulations are similar to those of the CDM simulations, except 
that they use 150^ particles, a 300^ mesh, and a cube of size 300 h~^ Mpc. In all of our models we 
choose a random subset of N-body particles to represent galaxies, thus implicitly assuming that 
galaxies are unbiased tracers of the mass. 


3. Systematic Effects 

When measuring the genus curve of a mock catalog or of a volume limited subset of the 1.2 
Jy survey, we first compute the galaxy density field on a cubic mesh, using cloud-in-cell (CIC) 
binning. We convert this density field pg to a density contrast field 6g = {ug — rig)/rig, with 
rig = Ng/Vs the mean galaxy density of the sample. We set = 0 in all mesh cells outside the 
sample volume, i.e., with R > i?max or within 5° of the Galactic plane. We also create a “mask” 
array that is 1.0 for all cells within the sample volume and 0.0 for all exterior cells. We smooth 
the galaxy density contrast field by convolving it with the Gaussian window function (|I|). At each 
cell, we want the convolution to cover only that portion of the smoothing window that lies within 
the sample volume. Technically, we accomplish this objective by smoothing both the density 
contrast array and the mask array with a Fast Fourier Transform convolution, then dividing the 
smoothed density contrast by the smoothed mask. Since = 0 outside the sample boundary, the 
exterior regions do not contribute to the convolved density contrast, and dividing by the smoothed 
mask provides the necessary volume normalization. This is the smoothing procedure advocated 
by Melott &: Dominik (1993), who tested a variety of schemes for defining smoothed density fields 
from finite samples. 
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When we apply CONTOUR to measure the topology of isodensity surfaces in this smoothed 
field, we sum the Gaussian curvature only over those vertices whose surrounding cells all he within 
the sample volume (Weinberg 1988; Gott et ah 1989). We compute the genus at 19 values of v, 
ranging from —2.5 to 2.5, with v defined in terms of the contour’s enclosed fractional volume by 
equation (^). In order to reduce noise in the genus curve, we set Gs{y) equal to the mean of the 
five genus values measured sA, v — 0.05, v — 0.025, v, v + 0.025, and v + 0.05; this is similar to the 
boxcar local smoothing used by Vogeley et al. (1994). 

If we could measure the topology of the galaxy distribution from perfect data in very large 
volumes, then the genus curve 5 s(z^), where gg = GsjV is the genus per unit volume, would 
approach a global average that would be independent of the details of the data sample. The genus 
curve measured in a limited volume will differ from this global mean genus curve in part because of 
random statistical fluctuations, which would average to zero in the analysis of many independent, 
equivalent samples of the same size. However, there are also effects that cause the genus curves 
measured from finite galaxy redshift samples to differ systematically from the true global mean. 
The first is discreteness error, which arises because our input galaxy distribution is a series of 
Dirac (5-functions rather than a continuous field. Our adopted smoothing criterion, A ~ d/\/2, 
ensures enough smoothing to suppress strong discreteness distortion (Weinberg et al. 1987), but 
the topology of the smoothed density field may still differ from that which would be obtained 
by starting from a more densely sampled galaxy distribution, with d <C \/2A. A second class 
of systematic error arises from the hnite volume itself. Contours run into the sample boundary, 
so some of the holes or isolated pieces of the contour are not fully contained within the sample. 
The CONTOUR algorithm can count “fractional holes” because it sums the Gaussian curvature 
K only over vertices contained within the sample, but the values of K along the boundary are 
correlated, and there can be systematic biases whenever the summed curvature along the boundary 
is a significant fraction of the summed curvature in the interior. Another finite volume error arises 
because we plot gs against v defined by equation (|5|) using the fractional volume / enclosed by 
the contour within the survey region. In general this is not the same as the global value of / at 
the same threshold density, though it should be close if the survey volume is large enough to be 
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statistically representative. A third systematic error arises because we set the density contrast 
outside the sample region to zero before smoothing instead of to the true (but unknown) density 
contrast, so that cells within one or two smoothing lengths of the boundary do not have the 
correct smoothed density value. Finally, peculiar velocities can distort the genus curve because 
we compute galaxy positions from redshifts rather than true distances. Melott et al. (1988) use 
numerical simulations to argue that peculiar velocity effects are small, and Matsubara (1996) 
demonstrates this point analytically using linear perturbation theory. 

We will compare our observational results for the 1.2 Jy data to theoretical predictions based 
on mock catalogs that are analyzed in the same fashion as the data. This approach allows a fair 
test of theoretical models regardless of the systematic effects, but it is nonetheless valuable to 
know just how these influence the observed genus curve. We can judge this by comparing the 
average mock catalog results to those obtained from the full, periodic, densely sampled simulation 
cubes, which constitute effectively perfect data. 

Figure illustrates this comparison for the i?max = 60 h~^ Mpc data sample using the low-fl 
CDM simulations described in §2, which have a power spectrum similar to that observed for 
IRAS galaxies (Fisher et al. 1993). The solid lines show the genus curves measured from the full, 
densely sampled simulation cubes, with A = 7 h~^ Mpc (left) and A = 9 h~^ Mpc (right). We 
average results from the four 400 h~^ Mpc cubes, and we plot 47r^A^gs, where gg is the genus 
per unit volume, so that the expected amplitude of the curve is of order unity and independent 
of the simulation volume itself (see eqs. and |Q]). These curves, free of systematic errors, 
are labeled Eq. The dotted curves are measured from the full, periodic cubes after the galaxy 
distribution is randomly sampled to the mean density of the 1.2 Jy, J?max = 60 h~^ Mpc sample, 
fig = 1.17 X 10~^h^ Mpc“^. These curves, labeled Ei, contain discreteness error but no other 
systematic biases. Comparing to the solid curves, we see that discreteness raises the amplitude of 
the genus curve and shifts it to the left, but with A = 7 or 9 h~^ Mpc the impact is small. Next we 
take the smoothed held created from the sampled particle distribution in the periodic cubes, but 
we measure the topology in volumes that have the size and geometry of the 1.2 Jy sample, i.e.. 
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Fig. 2.— The influence of systematic biases on the measured genus curve, for smoothing lengths 
A = 7 h~^ Mpc (left) and A = 9 h~^ Mpc (right) and sample geometry and mean galaxy density 
equivalent to those of the i?max = 60 h~^ Mpc volume limited subset of the 1.2 Jy survey. Solid 
lines show genus curves measured from the effectively “perfect” theoretical data provided by our 
four CDM simulations, each of which is a densely sampled, 400 h~^ Mpc, periodic cube. Dotted, 
short-dashed, long-dashed, and dot-dashed curves incorporate, successively, discreteness effects, 
finite volume effects, boundary smoothing effects, and peculiar velocity effects, as discussed in the 


text. 
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60 h~^ Mpc spheres with 10° wedges removed. Averaging over 512 such volumes, we obtain the 
dashed curves in Figure labeled Ei + E 2 because they include both discreteness error and finite 
volume effects. The latter amplify the genus curve substantially at all values of v. However, when 
we smooth using only the sample volume itself, i.e., carry out our full observational procedure but 
on mock catalogs that use true distances instead of redshifts, we obtain the long-dashed curves 
El + E 2 + E^, which are much closer to the true genus curves Eq. It thus appears that the finite 
volume and boundary smoothing errors largely cancel each other. Finally, the dot-dashed curve, 
labeled StotaU shows the average genus curve measured from the 512 mock catalogs in redshift 
space. It is similar to Ei + E 2 + E'i, indicating, as expected from earlier studies, that peculiar 
velocities have little effect on the genus curve. 

As we shall soon see, the difference between the Eq and Stotai curves in Figure ^ is small 
compared to the random statistical fluctuations expected in a single volume the size of our 1.2 
Jy sample. In this sense, the cumulative effect of the systematic errors that we have described is 
small, though it is disconcerting to see that this small cumulative impact reflects a cancellation 
between two types of errors (finite volume and boundary smoothing) that appear to be quite 
substantial individually. After a number of tests, we remain somewhat puzzled about the nature 
of the finite volume effect and the reason for its cancellation by the boundary smoothing effect. 
The systematic amplification of the genus curve in a finite volume occurs for cubical masks and 
spherical masks as well as for our IRAS masks (spheres with missing wedges), and it occurs for 
Gaussian random fields as well as for N-body models. It gradually disappears as the survey volume 
becomes large (compared to the smoothing volume), presumably because “fractional” holes then 
make a small contribution to a contour’s total genus. Note that the difference between locally and 
globally defined values of v (e.g., the fact that = 0 corresponds to the median density within 
the sample instead of the true median density) can alter the shape of the genus curve but cannot 
produce an overall amplification, so it is not responsible for the effect seen here. 

The systematic biases illustrated in Figure ^ are not significantly larger for A = 7 h~^ Mpc 
than for A = 9 h~^ Mpc. Even though the larger smoothing length suppresses discreteness 
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more effectively, it leads to larger finite volume effects because the number of independent 
structures within a sample volume is smaller. The random statistical fluctuations should be 
smaller with \ = 7 h~^ Mpc for the same reason, so we adopt this smoothing length for the 
1.2 Jy, i?max = 60 h~^ Mpc sample. With a much smaller smoothing length, discreteness 
effects would become excessive. We have carried out analysis similar to that in Figure ^ for 
our iimax = 30 h~^ Mpc and i?max = 100 h~^ Mpc samples, with A « djypi = 4 h~^ Mpc and 
12 h~^ Mpc, respectively. The qualitative results are similar, although the systematic biases are 
somewhat stronger for these sample volumes because of the smaller numbers of resolution elements 
(see Figure |l]). 


4. Random Errors and Statistical Methodology 


We ultimately wish to use the measured topology of structure in the 1.2 Jy survey to evaluate 
the viability of theoretical models for the origin of this structure. By using mock catalogs, we can 
ensure that the theoretical predictions incorporate the same systematic biases that influence the 
observational data. We are then faced with the task of deciding whether the model predictions 
are consistent with the data to within the expected random errors, and whether the data favor 
one theoretical model over another. These evaluations will have the maximum statistical power if 
they are based on the likelihood C = P(Data|Model), the conditional probability of obtaining the 
observed data given the assumed model. 


If the random errors in the values of Gs measured at N different values of v were independent 


and Gaussian distributed, then the likelihood would be 

1 ^ 


C = exp(--x^i^g) Y[ {2TTaf) 


2N-1/2 


2 = 1 


where 


2 ^ 

Xdiag — ^ 2 


( 7 ) 


( 8 ) 


2=1 


=;Model 


and Gg {vi) and af are respectively the mean and variance of the genus values obtained from 


the mock catalogs. In a “frequentist” statistical analysis, one typically evaluates the acceptability 
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of a model by asking whether its x^i^g “reasonable”, i.e., whether Xdiag/-^ ^ Bayesian 

analysis, the ratio of likelihoods for two models tells one how to update the relative assessment of 
these models in light of the new data, since the ratio of models’ posterior probabilities is equal to 
the ratio of their prior probabilities multiplied by their likelihood ratio. 

The statistical evaluation of topology data is complicated by the fact that random errors 
at different values of v are not independent. Error correlations arise because a given volume 
contains only one set of structures — changing in a given volume is not equivalent to changing 
V and simultaneously moving to a different region of the universe to obtain an independent genus 
measurement. The genus curve of an individual mock catalog is typically shifted or amplified 
coherently relative to the average model prediction. Figure shows the covariance matrix, 

Q, = , (9) 

computed from 512 mock catalogs of the i?max = 60 h~^ Mpc sample drawn from the CDM 
simulations, with A = 7 h~^ Mpc. Circles and squares represent positive and negative values of Cij, 
respectively, and the area of the symbol shows the magnitude \Cij\. The matrix is approximately 
diagonal in the sense that the largest element in any row Vj is the variance Cjj = (t|, but there are 
significant correlations between the errors at neighboring values of v, and there are anticorrelations 
for \i/j — t'il ~ 1, reflecting the coherent shifts of genus curves mentioned above. The errors are 
largest at « 0, but this is mainly because the genus curve itself peaks here. 

Even though the errors in the genus curve are not independent, we can adopt the working 
hypothesis that the error distribution is a multivariate Gaussian. In this case the likelihood is 

£ = exp(-lx2^, (10) 

where 

N 

( 11 ) 

hi 

and C~j^ is the inverse of the covariance matrix defined in equation (|9|). If the errors were 
independent, then the covariance matrix would be diagonal, Cij = Sijaf, and equation ( 0 ) would 
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Fig. 3.— The covariance matrix Cij (eq. |^) obtained from the CDM mock catalogs with 
7?max = 60 h~^ Mpc and A = 7 h~^ Mpc. The area of the symbol plotted at r',, Vj is proportional to 
the magnitude of Cij (see scale at right), with circles and squares representing positive and negative 
matrix elements, respectively. 
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Fig. 4.— The inverse covariance matrix for the CDM mock catalogs, in the same format as 

Fig. § 
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reduce to equation (|^). To the extent that the error distribution is indeed multivariate Gaussian, 
the quantity should follow a x^ distribution with N degrees of freedom, so we can still use 

the criterion ^ 1 as a frequentist evaluation of a model’s success. Even if the multivariate 

Gaussian assumption does not hold perfectly, x^ still provides a useful goodness-of-fit measure, 
and we can use the mock catalogs to derive its distribution empirically. With the Gaussian-error 
approximation, we can also use equation dD to compute likelihood ratios for comparisons between 
models. 

Figure shows the inverse covariance matrix corresponding to the covariance matrix in 
Figure The main impact of the correlations in Cij is to introduce negative terms immediately 
off the diagonal in C~j^. From equation (pT[) we see that these negative off-diagonal terms mean 
that deviations of the same sign at neighboring values of v are “penalized” in the likelihood less 
strongly than they would be if we ignored the error correlations by using equation (^). 

While our method of treating correlated errors has not been used in previous topology 
analyses, it is similar to the approach used by Fisher et ah (1994b) and Gole et al. (1995) in their 
studies of anisotropic redshift-space clustering in the 1.2 Jy survey. 


5. Results 

5.1. Topology of the 1.2 Jy Survey 

As mentioned earlier, we perform the topology analysis at three different smoothing scales, 

A = 4, 7, and 12 h~^ Mpc. For each scale, we use a volume limited subset of the 1.2 Jy survey 
with outer radius Rmax chosen so that the mean intergalaxy separation is d « A\/2. The sample 
radii are i?max = 30, 60, and 100 h~^ Mpc, respectively. Since each sample is at least 4.6 times 
the volume of the preceding one, and the smoothing scales themselves differ by factors ~ 1.7, the 
genus curves obtained from these three samples are effectively independent. 

The i?max = 60 h~^ Mpc sample has the largest number of resolution elements, as shown in 
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Fig. 5.— Genus curve of the i?max = 60 h~^ Mpc, volume limited subset of the IRAS 1.2 Jy survey 
(solid line), with a smoothing length \ = 7 h~^ Mpc. Error bars (Icr) are computed from the 
CDM mock catalogs. The dotted line shows the genus curve expected for a Gaussian random field, 
eq. (i, with amplitude chosen by minimization as described in the text. 






















Fig. 6.— Like Fig. |5[ but for the i?max = 30 /i ^ Mpc sample and smoothing length A = 4 /i ^ Mpc. 
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Fig. 7.— Like Fig. but for the i2max = 100 h ^ Mpc sample and smoothing length 

A = 12 h~^ Mpc. 
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Figure The solid line in Figure shows the genus curve of this sample. We attach Icj error bars 
computed from the CDM mock catalogs; they are the square-roots of the diagonal elements of 
the covariance matrix illustrated in Figure While the size of the error bars varies from one 
theoretical model to another, we have chosen a model that is known to reproduce other clustering 
properties of galaxies in the 1.2 Jy survey fairly well. Figures ^ and ^ show the genus curves of 
the 30 h~^ Mpc and 100 h~^ Mpc samples, respectively, with error bars computed from the CDM 
mock catalogs of these samples. 

We will conduct a detailed comparison to theoretical models in the next section, but as a guide 
in Figures M we show a genus curve with the form (|^ predicted for a Gaussian random field. 
In each case we choose the amplitude A by minimizing using the covariance matrix computed 
from the CDM mock catalogs. (A better way of choosing the amplitude would be to use the 
covariance matrix for the Gaussian random field models, but as we do not have mock catalogs for 
those we do not know the covariance matrix.) The fitted amplitudes are A = 5.5, 3.6, and 12.0 for 
Rma.x = 60, 30, and 100 h~^ Mpc, respectively, corresponding (through eq. Q) to effective power 
spectral indices neo = —1.9, n-so = —1.9, and nioo = —1.0. At smoothing lengths A = 4 h~^ Mpc 
and X = 7 h~^ Mpc, the observed genus curves are shifted to the left relative to the best fitting 
Gaussian field predictions. This shift in the direction of an isolated cluster or “meatball” topology 
has also been seen in a number of other data samples at similar smoothing lengths (Gott et al. 
1989; Moore et al. 1992; Park et al. 1992). Yess et al. (1996) also find evidence for a “meatball” 
topology in the 1.2 Jy survey from percolation analysis. At A = 12 h~^ Mpc, the observed genus 
curve is symmetric and similar in form to the Gaussian field prediction. Visual examination with 
the plotted error bars suggests that the observed genus curve disagrees significantly with the 
Gaussian field curve for A = 4 h~^ Mpc, is marginally compatible with it for A = 7 h~^ Mpc, and 
matches it well for A = 12 h~^ Mpc. This impression is borne out by the values, which are 34.0, 
21.2, and 15.3 for the three smoothing lengths, respectively, with 18 degrees of freedom (19 data 
points less one free fitting parameter). These values should not be taken too literally, since 
they are all computed using the GDM covariance matrix. Still more important, the Gaussian field 
predictions do not include any effects of nonlinear gravitational evolution, and they do not include 
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the systematic biases discussed in §3. 


5.2. Comparison to Models 

Our goal in this section is to test dynamically evolved models with Gaussian initial conditions 
against the observed topology of the 1.2 Jy survey. We will also consider a simple example of a 
model with a non-Gaussian topology. As discussed in §2, we have performed N-body simulations 
starting from Gaussian initial conditions with a F = 0.25 CDM spectrum and power-law spectra 
with n = 0, n = —1, and n = —2. In all models we assume that Oq = 0.3 and that galaxies are 
unbiased tracers of the mass distribution. For each of the models, we have used these simulations 
to construct 512 mock 1.2 Jy catalogs at each value of i?max 5 and from these we compute the mean 
predicted genus curve and the covariance matrix of the random errors. 

As an example, Figure ^ shows, in the left hand panel, the mean genus curve of the GDM mock 
catalogs (solid line) for A = 7 h~^ Mpc, Rmax = 60 h~^ Mpc. Error bars show the dispersion of the 
mock catalog genus values at each value of v, they are the square-roots of the diagonal elements of 
the covariance matrix. The dotted line shows the observed genus curve of the i?max = 60 h~^ Mpc 
sample, repeated from Figure |^. Visual examination suggests that the GDM model and the IRAS 
data agree fairly well given the size of the Icr error bars, though a “chi-by-eye” cannot easily take 
error correlations into account. 

The value of from this comparison, using the CDM covariance matrix and equation ([Il|), is 
= 24.0. We can use this value to obtain a “frequentist” measure of the goodness-of-fit between 
the CDM prediction and the observations. Since there are no free parameters chosen to fit the 
data, the number of degrees of freedom is N = 19, one for each data point. If the CDM model 
were correct and the distribution of errors were truly a multivariate Gaussian, the probability 
of getting a value this large or larger would just be the integral of the x^ distribution for N 
degrees of freedom: 

roo 

Fg{x^-N) = / git:N)dt 
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Fig. 8.— Comparison between the genus curve of the 1.2 Jy data (i?max = 60 h~^ Mpc sample, 
A = 7 h~^ Mpc) and the predictions of the low-fl CDM model. The left panel shows the observed 
genus curve (dotted line) and the mean genus curve of the 512 mock catalogs drawn from the 
CDM simulation (solid line). Error bars show the la dispersion of the mock catalog results. The 
value computed using the model covariance matrix is x^ = 24.0. In the right hand panel, 
the solid histogram shows the distribution /(x^) of mock catalog x^ values relative to the mean 
CDM prediction. The dotted line shows the corresponding cumulative distribution F{y^). The 
filled circle at x^ = 24.0, F(x^) = 0.184 indicates that 18.4% of mock catalogs in a CDM universe 
have a x^ value larger than that obtained for the 1.2 Jy data. The smooth solid curve shows the 
distribution g{x^ ■ 19) expected for the case of a multivariate Gaussian error distribution. 
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where we use the subscript G on Fq to denote the Gaussian-error assumption. However, since we 
have a large number of mock catalogs available to us, we do not have to rely on equation (^). 
Instead, we can treat a statistic motivated by the Gaussian-error assumption but calibrate its 
distribution directly using the mock catalogs. The histogram in the right hand panel of Figure ^ 
shows the distribution f{x^), the fraction of the GDM mock catalogs that produce this value 
when compared to the mean CDM genus curve, in bins of width Ax^ = 1. The dotted curve shows 
the corresponding cumulative distribution F(x^), the integral of f{x^)- A heavy point is plotted 
at the value F{x^) = 0.184 corresponding to the observed value of x^ = 24.0. We thus see that 
in the GDM model universe, 18.4% of random observers would find a discrepancy with the mean 
GDM genus curve that is as large as or larger than that found for the 1.2 Jy survey. We conclude 
that the GDM model does indeed yield acceptable agreement (at the ~ la level) with the observed 
topology of this data sample for A = 7 h~^ Mpc. 


The smooth curve in the right hand panel of Figure shows g(x^ ■ 19)) a distribution 
with 19 degrees of freedom. This curve tracks the mock catalog histogram /(x^) remarkably 
well, implying that the assumption of a multivariate Gaussian error distribution is indeed a good 
approximation for these purposes. The value Fg(24.0 : 19) = 0.196 obtained from equation (^) 
is close to the value F(x^) = 0.184 obtained from the mock catalogs. We have carried out similar 
comparisons for our other models and other sample radii. We find that the agreement between 
the mock catalog f(x^) and g(x^ : 19) holds quite well in most cases, though the mock catalog 
distributions tend to have somewhat longer tails towards high x^j so assuming Gaussian errors 
tends to underestimate the probability of the most extreme events. 

Figures P, |T^, and 0 show the comparisons between our four dynamically evolved, initially 
Gaussian models and the 1.2 Jy data, for (A, Rmax) = (7,60), (4,30), and (12,100), respectively. In 
each panel we show the mean predicted genus curve as the solid line with la error bars and the 
observed genus curve as the dotted line. We also list the value of x^ obtained using the model 
covariance matrix and the fraction F(x^) of mock catalogs that produce values of x^ at least this 
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Fig. 9.— Comparison between the observed genus curve of the 1.2 Jy sample for A = 7 h~^ Mpc 
(-Rmax = 60 h~^ Mpc) and the predictions of the four N-body models, which assume Gaussian 
initial conditions with CDM, n = 0, n = — 1, and n = —2 initial power spectra. In each panel, the 
dotted line shows the observed genus curve, and the solid line with Icj error bars shows the mean 
model prediction computed from the mock catalogs. The values of and T(x^) are listed in each 
panel. 
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Fig. 10. — Same as Fig. ^ but for a smoothing scale A = 4 /i ^Mpc (i?max = 30 /i ^ Mpc). 
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Fig. 11.— Same as Fig. ^ but for a smoothing scale X = 12 h ^Mpc (i?max = 100 h ^ Mpc). 
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large when compared to the mean model prediction. 

For A = 7 h~^ Mpc (Fig. |^, the CDM and n = — 1 models give similar results, both in 
reasonable agreement with the observed genus curve. The n = 0 model predicts a higher amplitude 
genus curve, as expected given the greater amount of small-scale power in its initial conditions 
(see eq. |Q]). However, this model also predicts larger error bars than the CDM or n = — 1 models, 
and its of 23.8 is similar to the CDM = 24.0. A discrepancy of x^ ^ 23.8 is found for 
F{x^) = 22.3% of the n = 0 mock catalogs. The n = —2 model predicts the lowest amplitude 
genus curve, and it also has x^ = 24.0, with F{x^) = 18.0%. At A = 7 h~^ Mpc, all four models 
predict an approximately symmetric genus curve, with a small asymmetry between the topology 
of high and low density regions that reflects the combination of nonlinear gravitational evolution 
and systematic biases in the topology measurements. 

For A = 4 h~^ Mpc (Fig. 0), the amplitude of the CDM predicted genus curve is lower than 
that of the observed genus curve. The comparison yields x^ = 30.4, F{x^) = 0.088, indicating 
only marginal compatibility between the model and the data. The n = 0 model predicts a higher 
amplitude curve that agrees fairly well with the observations. The n = — 1 prediction is similar to 
the CDM prediction, but it is different enough to yield better quantitative agreement with the 
1.2 Jy data. The n = —2 model predicts a genus curve whose amplitude is too low. It is strongly 
contradicted by the data, with x^ = 38.1 and F{x^) = 0.02. At this smoothing length, all four 
models predict a genus curve that is shifted slightly in the direction of a “meatball” topology, but 
in all cases the shift is smaller than that seen in the 1.2 Jy genus curve. 

At A = 12 h~^ Mpc (Fig. 0), the observed genus curve is quite symmetric, and it agrees 
well with the CDM, n = 0, and n = — 1 model predictions. Indeed, the match to the n = 0 and 
n = — 1 curves is so good that in each case more than 90% of the mock catalogs have higher x^ 
than the 1.2 Jy data. The amplitude of the n = —2 genus curve is lower than observed, but for 
this smoothing length the discrepancy with the data (x^ = 21.9) is not very significant, with 
F{x^) = 29.1%. 


As a simple example of a model with a non-Gaussian topology, we consider a Voronoi foam. 
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Fig. 12.— Comparison of the observed genus curves (dotted lines) to the predictions of the Voronoi 
foam model (solid lines with Icj error bars), for the three smoothing lengths and corresponding 
sample volumes. Values of and Fij^) are listed in each panel. 
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which has often been used as a phenomenological description of the large scale structure of the 
galaxy distribution (van de Weygaert 1994 and references therein). Starting with a Poisson 
distribution of particles in a box of size 300 h~^ Mpc, we randomly distribute “seeds” with a mean 
separation ds = Us = 50 h~^ Mpc, then project each particle radially outward from the nearest 
seed until it is equidistant with the second-nearest seed. This procedure distributes the particles 
on the walls of polygonal cells, whose faces are the perpendicular bisector planes of neighboring 
seeds. The resulting “bubble” topology is similar to that of structure that evolves from initial 
conditions with a negatively skewed probability distribution (Weinberg &: Cole 1992). We generate 
four Voronoi foams of this sort and create mock catalogs and compute mean predictions and 
covariance matrices just as we have done for the N-body models. 


Figure ^ shows the comparison between the Voronoi foam model and the 1.2 Jy data for 
the three smoothing lengths. On all three scales, the genus curve of the Voronoi foam shows the 
rightward shift characteristic of a bubbly (as opposed to spongelike) distribution. In every case, 
the value is very large, greater than that in 99% or more of the mock catalogs. 


Table 1 summarizes the results of these comparisons, listing the values of s-nd F{x‘^) for 
each combination of model and smoothing length. We can directly compare the ability of these 
models to account for the topology of the 1.2 Jy data using a likelihood ratio test. For this 
purpose, we will assume that the error distribution is indeed multivariate Gaussian, so that the 


relative likelihood of obtaining the observed genus curve in models A and B is, from equation (IT), 


Ca _ 


exp 




(13) 


Because the genus curves for the three smoothing lengths are effectively independent (see §5.1), the 
overall likelihood ratio of two models is simply the product of their likelihood ratios for the three 
different samples. Note that the covariance matrix determinants enter into the likelihood ratio 
as well as the x^ values themselves: for instance, if two models have the same x^ value but one 
predicts smaller random errors, then the model that makes the tighter prediction is preferred. The 
determinant ratio factor can be quite significant, as one can easily see for the case of uncorrelated 
errors (C^- = Jijcr?). If one model predicts error bars that are consistently 20% larger at each of 
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the iV = 19 z/ values, then the determinant ratio factor is 1.2'^ ~ 32 at each smoothing length. 

The likelihood ratios (relative to the CDM model) are listed in the last column of Table 1. 
We have already seen from the test that the CDM, n = 0, and n = — 1 models are separately 
compatible with the observed topology of the three volume limited 1.2 Jy samples at the > 5% 
level. The likelihood ratios, which combine the information from all three samples, show that the 
n = — 1 model (advocated on other gronnds as early as Gott & Rees 1975) is the most successful 
overall, with a likelihood ~ 90 times higher than that of the CDM model. While the n = 0 model 
produces lower x^ values than the CDM model at each smoothing length, its overall likelihood 
is only about half that of the CDM model because of its systematically larger error bars. The 
n = —2 model is strongly disfavored by the data, with a likelihood ratio of 0.002 relative to CDM 
and 2.2 x 10“^ relative to the n = —1 model. We have also investigated power-law models with 
Do = 1 instead of Dq = 0.3 (but otherwise identical to our standard power-law models), and 
although the predicted genus curves are not radically different in the high density models, the 
statistical discrepancies with the observations are substantially larger for n = — 1 and n = — 2 and 
slightly smaller for n = 0. The Voronoi foam has a formal likelihood ratio of 4.4 x 10“^®, and while 
the assumption of Gaussian errors surely breaks down at this level, it is clear that the observed 
topology of the 1.2 Jy survey is inconsistent with the “bubble” topology of this model. 




A (Rmax) 




Model 

7 (60) 

4 (30) 

12 

(100) 


CDM 

24.0 (0.184) 

30.4 (0.088) 

12.8 

(0.854) 

1.00 

n = 0 

23.8 (0.223) 

19.3 (0.428) 

10.1 

(0.934) 

0.57 

n = —1 

18.4 (0.477) 

23.6 (0.211) 

11.4 

(0.926) 

91.8 

n = —2 

24.0 (0.180) 

38.1 (0.020) 

21.9 

(0.291) 

0.002 

Voronoi 

46.5 (0.002) 

108.6(0.002) 

38.1 

(0.008) 

4.4 X 10-26 


Table 1; Summary of comparisons between models and the observed genus curves. Columns 2, 3, 
and 4 list the values of x^ and T(x^) for smoothing lengths A = 7, 4, and 12 h~^ Mpc, respectively. 
Column 5 lists the overall likelihood of the model relative to the likelihood of the CDM model. 



6. Summary 


We have measured the topology of the galaxy distribution in the IRAS 1.2 Jy redshift 
survey using the methods of GMD, GWM, and Gott et al. (1989). We consider three volume 
limited subsets of the data, with limiting radii Rmax = 30, 60, and 100 h~^ Mpc, analyzed with 
corresponding smoothing lengths A = 4, 7, and 12 h~^ Mpc. We use mock catalogs drawn from 
cosmological N-body simulations in order to derive theoretically predicted genus curves and to 
study the systematic and random errors expected in samples of this size. Our principal conclusions 
are; 

(1) In tests on mock catalogs from low-H CDM simulations, the net systematic error in volumes 
the size of our 1.2 Jy subsamples is small compared to the random errors. However, this small net 
error involves a cancellation between the systematic effects of measuring the genus in a volume 
that contains few independent structures and the effects of smoothing only with that portion of 
the smoothing window that lies within the sample volume. 

(2) The covariance matrix of random errors in the genus curve is predominantly diagonal, but 
there are significant correlations in the errors that should be taken into account when assessing 
theoretical models. To a reasonable approximation, (including covariances) is distributed as it 
would be if the errors followed a multivariate Gaussian distribution, though extreme values of 
are more common than they would be for purely Gaussian statistics. 

(3) With A = 12 h~^ Mpc, the genus curve of the 1.2 Jy data has a symmetric form similar to that 
predicted for a Gaussian random field. For A = 7 h~^ Mpc and 4 h~^ Mpc, the observed genus 
curves are increasingly shifted in the direction of a “meatball” topology. 

(4) Taken individually, the three observed genus curves are consistent at the > 5% level with the 
topology of dynamically evolved N-body models that have Gaussian initial conditions with low-fl 
GDM (r = 0.25), n = 0, or n = — 1 power spectra. Combining all three data sets, the n = — 1 
model is the most successful overall, with a likelihood ratio of 91.8 relative to CDM and 161 
relative to n = 0. 

(5) The observed genus curves are inconsistent with an n = —2, initially Gaussian model, which 
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produces structure that is excessively coherent and, consequently, genus curves whose amplitudes 
are too low. The observed genus curves are strongly inconsistent with a Voronoi foam model, 
which, on account of its “bubble” topology, predicts genus curves that are systematically shifted 
towards higher densities, in the opposite direction from the observed shifts. 

Our conclusions about the shapes of observed genus curves and their consistency or 
inconsistency with various theoretical models are similar to those drawn from a number of other 
topological studies of the galaxy distribution (Gott et ah 1989; Moore et al. 1992; Park et al. 1992; 
Colley 1996). They are somewhat at odds with the results of Vogeley et al. (1994), who found that 
genus curves measured from the extended CfA redshift survey showed shifts in the direction of a 
bubble topology and were inconsistent with the genus curves predicted by CDM N-body models. 
The differences could reflect systematic differences in the structure traced by optical and IRAS 
galaxies, differences in the details of the topology analysis, or the somewhat greater statistical 
power of the CfA data set at the short smoothing lengths (A ~ 5 Mpc) where the differences 
are most pronounced. 

Earlier topology studies, recognizing the problem of correlated errors in the genus curve, have 
developed “meta-statistics” that characterize the overall shape of the genus curve (e.g., amplitude, 
asymmetry, width) and used these to assess the compatibility of models with the observations (see, 
e.g., Vogeley et al. 1994). Here we have taken the more direct approach of measuring the error 
covariance matrix from mock catalogs and incorporating it into model assessments. We compute 
values that include the error covariance (eq. 0) and calculate the distribution of from 
mock catalogs, in order to get an absolute, frequentist assessment of a model’s goodness-of-fit. 
The distribution of values in the mock catalogs implies that the multivariate Gaussian 
approximation describes the error distribution quite well, failing only in the extreme tails. We 
perform likelihood ratio comparisons between theoretical models to assess their relative ability to 
account for the observed topology data, making use of the Gaussian-error approximation. The 
advantages of our approach are that it has a clear statistical motivation, it provides a natural 
path for combining information from independent data samples, and, to the extent that the 
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Gaussian-error approximation holds, it makes the best possible use of the data because it is based 
directly on the likelihood. The disadvantage is that a high or low likelihood value says nothing 
in itself about how the model prediction and the data disagree. Thus, the likelihood approach 
used here is the most statistically powerful way to assess and compare models, but measures like 
those of Park et ah (1992) and Vogeley et ah (1994) may be useful for quantifying the nature 
of discrepancies between theory and observation. Our approach to handling correlated errors is 
similar to that used by Fisher et al. (1994) and Cole et al. (1995) in studies of redshift space 
distortions of the correlation function and power spectrum, and it can be adapted to many other 
problems in which error correlations are important but computable. It should be especially useful 
for topological analyses of future large galaxy redshift surveys, such as the Anglo-Australian 
2dF survey and the Sloan Digital Sky Survey, which will yield much more stringent tests of the 
hypothesis that structure in the universe formed from Gaussian primordial fluctuations. 

We are grateful to Michael Strauss and Karl Fisher for providing us with the 1.2 Jy redshift 
data in electronic form and for numerous helpful discussions on large scale structure. We thank 
Richard Gott, Adrian Melott, and Andrew Hamilton for helpful discussions on topological analysis 
of redshift surveys over the course of many years. We acknowledge support from NASA Grants 
NAG5-3111 and NAG5-2759 and NSF Grant AST95-29120. 
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